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I Abstract 

^ ' In testing the independence of two Gaussian populations, one com- 

putes the distribution of the sample canonical correlation coefficients, 
given that the actual correlation is zero. The "Laplace transform" 
of this distribution is not only an integral over the Grassmannian 
of p-dimensional planes in complex n-space, but is also related to a 
OO ■ generalized hypergeometric function. Such integrals are solutions of 

! Painleve-like equations. They also have expansions, related to ran- 

dom words of length £ formed with an alphabet of p letters. Given 
that each letter appears in the word, the maximal length of the dis- 
joint union of p increasing subsequences of the word clearly equals 
1^ I i. But the maximal length of the disjoint union of p — 1 increasing 

a • subsequences leads to a non-trivial distribution. It is precisely this 

^ . probability which appears in the expansion above. 
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Random words, longest increasing sequences and mean hook lengths 

Consider the set of words 

71 & := {words vr of length i, built from an alphabet {1, ...,p}} , 
with the uniform probability distribution 

P''^{n) = i (0.0.1) 
pi 

The RSK correspondence (see section 2.1) between words and pairs of semi- 
standard and standard tableaux induces a probability measure on partitions 

XeYe = {partitions A G Y of weight |A| = i}, (0.0.2) 

given by 

P^'P(X) = ll^^j^^ |A|=£, (0.0.3) 

where sx is the Schur polynomial associated with the partition A, 

p 

P = (1, . . . , 1, 0, 0, . . . ) and /'^ = ^{standard tableaux of shape A}, 

(0.0.4) 

withQ 

( support P^'P) C yJ^^= {A G Y^, such that \J <p}. 

A subsequence a of the word vr is weakly k -increasing, if it can be written as 

(T = 0-1 U 0-2 U . . . U o-fc, (0.0.5) 

where o-j are disjoint weakly increasing subsequences of the word vr, i.e., 
possibly with repetitions. The length of the longest increasing/decreasing 
subsequences is closely related to the shape of the associated partition, via 
the RSK correspondence : 

^ /\ _ f length of the longest strictly 1 _ 

decreasing subsequence of vr J ^ 

^ f\ _ f length of the longest weakly \ — \ \ 
\ /c-increasing subsequence of vr J ^ ' ' ' ^ 

(0.0.6) 

^A^ is the dual partition, i.e., obtained by flipping the Young diagram A about its 
diagonal. So, A^ is the length of the first column of A. 
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For integer < p < n, consider the fixed rectangular Young diagram of 
width n—p, 

V 

II — {n — pY :— {n — p,n — p, . . . ,n — p). (0.0.7) 

Consider a word tt e S^. Then the statement di{Ti) = p imphes, in par- 
ticular, that all letters of the alphabet {1, ...,p} are represented in tt; then 

automatically ip(7r) = £. The theorem below deals with the first non-trivial 
quantity ip_i(7r), given that dii^n) — p . Using the standard notation, defined 
for a general parameter /3 > 0, 

(«)a := Y[{a + P{1 - i))x„with :^ x{x + 1) . . . {x + n - 1), = 1, 

(0.0.8) 

we now state Theorem 0.1, which will be established in section 3.2 (note that 
here symbol (0.0.8) is used for (3 — 1): 



n—q q—p 

Theorem 0.1 Given the probability (0.0.1) and (0.0.3) , the following holds 
(h'^ denotes the product of hook lengths over all boxes of the partition n): 
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(0<p<n) 

V I h-ii^) <i-n + pj 



More generally, for fixed p < q < n, the mathematical expectation (with 
regard to P^'P) of the hook length of X, emanating from the vertical strip v 
( of width q — p), equals: 



(^) n 



n—q<.j<.n—p 



^! A (g - i) 



n 



E 



1 



kJ <p 



(0.0.10) 



Generating function for the mathematical expectation of the hook 
length, integrals over Grassmannians and Painleve V 



Theorem 0.2 below involves an integral over the Grassmannian 

U{n) 



Gr{p, C") 



U{p) X U{n - p) 



--: G/K 



(0.0.11) 



of p-dimensional planes in through the origin and Haar measure diJ,{Z) 
on Gr{p, C"), expressed in the parametrizing coordinate Z of 

Affine Grip, C") = jspan ^ ^ Z A2iA-^\ A e G^ (0.0.12) 



where A & G is represented in block form: 



p 


n—p 


> 

An 


< * 

A12 


A21 


A22 



I p 

In-p 



G G. 



(0.0.13) 



Section 1 will be devoted to the geometry of Gr(p, F"), where F = C, 
or H, and to the study of integrals over Gr{p, F"). 
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We also need Jack polynomials, which are the unique symmetric functions 
orthogonal with respect to a certain a-dependent inner-product (,)„, such 
that 

(Jf),j(")) = 5,,jf , (0.0.14) 

with 

^t^= n (Aj-^ + a(A,-j + l)) (Aj-z + l + «(A,-j))- 

Facts about Jack polynomials relevant for this project will be discussed in 
section 2.3. 

Finally, generalized hypergeometric functions 2^1"^ are defined by: {p, q,n & 

C, X = (Xi,X2,...)) 

„; .) := J] <lf^ a<'<^. (0.0.15) 

In particular, 

The following theorem will be established in section 4.2; it is strongly mo- 
tivated by certain integrals appearing in the context of testing statistical 
independence of Gaussian populations, as will be explained in the next para- 
graph. 

Theorem 0.2 For fixed p < q < n/2, the generating function for the math- 
ematical expectation of the hook length (0.0.10) over a strip, with regard to 
the probability (0.0.3), is given b'^ 

^Remember ( support P^'^) C {A G Y, such that |A| = t, Xj < p}. 
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n 



n — 2 ! 



X 



E 

£>p{n—p) 



( 



\ 



^{A2M}(A) n 



3) 



\ 



n — q'Cj^n — p 



J 



JGr(p,C") 



2F[^\p,q-n-y) 
exp 



^ u{y) — p{n — p) + py 



dy 



10 y 

where u{x) is the unique solution to the initial value problem: 
x^u'" + xu" + Qxu^ - Auu' + AQu' - 2Q'u + 2R = 



(0.0.17) 



with u{x) = p{n — p) X + . . . + +a„+ix'^"^"^ + Oi^x""^^) + . . . , near x = 0. 



n 



(Painleve V) 

+ . . . 

(0.0.18) 



with ttn+i specified by the hypergeometric function in (0.0.17) and 

AQ = -x'^ + 2{n + 2{p - q))x - {n - 2py 
2R = p{p — q){x + n — 2p). 

Remark: The constant Cn,g,p in (0.0.17) is the one below for (3 = 1: 



(0.0.19) 



c := c, 



n,q,p • 



n 

i=l 



Tjzp + l)T{p{n -q-i + l))T{(3{q + 
T{(3 + l)T{l3{n-i + l)) 



(0.0.20) 



Testing Statistical Independence of Gaussian Populations 



To summarize section 5, consider p+q normally distributed random variables 
(Xi, XpY and (Yi, YqY {p < l) with mean zero and covariance matrix 
S. According to Hotelling (|l)(see also Muirhead H], pl06), (Xi, ...,Xp)^ 
and (Fi, YqY can be replaced by a linearly transform of the X's and F's, 
so that the covariance matrix takes on the canonical form: 
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E = cov 



X 
Y 



where 



p 1 

i > i 

Ell Si2 

HJ2 ^22 



I P 

u 



y — 

^can — 



Pk 



o 

Pk+1 



o 



\ 



pp 



o 



p, k — rankEi2, 



is the p X g-matrix of canonical correlation coefficients: 

1 > Pi > P2 > ■ ■ ■ > Pfc > 0, Pk+l = ... = Pp = 0. 
The n {n > p + q) independent samples {xn, . . . , xip, yu, . . . , yiq)~^ , ■ ■ ■ , 
{xni, . . . , Xnp, Vni-i ■ ■ ■ , Unqf' i arising from observing ^ y- ^ l^ad to a matrix 

of size ri), having a normal distribution with correlation E. The 



X 



roots rf, . . . ,rp (sample canonical correlation coefficients) of the equation 



det {xy {yy ) yx — r xx ) = 

are the estimators (maximum likelihood estimators) of the canonical corre- 
lation coefficients pi, . . . , p^. 

In testing the null hypothesis, versus the alternative hypothesis, 



Ho: pI 



versus Ha : {p\, . . . ,pj) 7^ 



one needs the joint density of the rf, given that pf = . . . 
up to a (g, p, n) -dependent normalizing constant, the density is given by 



Pp = 0; namely, 



|Ap(r2)|^ f[(r2)/^(«-f+i)-i(l - r2)^(--^-f+i)-idr,2 



(0.0.21) 



1=1 



ior P — 1/2. This formula generahzes to the formula above, upon considering 
random variables (Xi, ...,Xp)~^ and {Yi, ...,Yq)~^ , with values in the complex 
C (/? = 1) and the quaternions H (/? = 2). 
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Expectation of the ratio of Jack polynomials, integrals over Grass- 
mannians, sample canonical correlation coefficients and PDE's 

Consider now a Poissonized probability on partitions A G Y, which depends 
on a parameter x > 0: 

P.,,(A) = e-^^- ^'^'-^f/^^l^^"^ = e-^^-^^^P^'^(A), A G Y, 

(0.0.22) 

with P^'P generalizing probability measure (0.0.3), as we shall see in section 
3.1, 

This probability has its support on A^^ < p. Many of these probability 
distributions on partitions have been introduced and extensively studied by 



Borodin, Kerov, Okounkov, Olshanski and Vershik (see ||T0|, [Tl|, |T^, pl| ) 
The following statement involves an integral suggested again by the statistical 
theory mentioned earlier. 

Theorem 0.3 For fixed p <q< n/2, the following holds ((3 = l/2,l,2j 



n,p,q 



f gX.T.(/+ztz)- det(ZtZ)-^(^-P)rf/i(Z) 

JGr{p,¥") 

J[o,i]p 1 
C2Fi'^''\Pp,Pq;Pn;y)\ _ ^ 

cexp [ v{y)dy. (0.0.23) 
Jo 



where c = (^}q^p is as in (0.0.20) and where 

• dfi{Z) is Haar measure on the space Gr{p, of p-planes in where 
F := C,M or the quaternions H. 
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» The integral over [0,1]^, appearing in (0.0.23), is the "Laplace trans- 
form" of the distribution of the sample canonical correlation coefficients 
(0.0.21). This integral is a Hdnkel determinant for (3 = 1 and a Pfaf- 
fian for (3 = 1/2 and 2. 

» v{y) := v^,p,q{y) and Ip := In^l,q satisfies the differential equation (define 
5i := 1 for (3=1 and := otherwise): 

4 (^yy + 6yy^ + (1 + 5f){2yy + Ay'^vv' + yf^)) - yP^v' + PiV + P2 



( 0, 



for /3 = 1, (Painleve V) 



-X ^-F> — ' M /5 = 1/2, 



16(p + l)(p + 2)" 11 

3^3 Ip-llp+l 

-y 



[ i62p + r /2 



for (3 = 2, 



with Pq quadratic and Pi, P2 linear polynomials in y, with coefficients 
depending on n and 



r = pq, s = n — 2p — 2q. 



(0.0.24) 



This statement will be established in section 4.1 and the differential 
equation part in section 6.1. As a by-product, we show incidentally that 
the multivariate hypergeometric function 2FI^\p, q; n; y) expressed in the 
iti = Efc>i?/^-variables are r-functions for the KP-hierarchy; but also that 
the function 2Pi^\p,q',n;y) properly restricted is a solution of Painleve V. 

). Section 7 gives new 



For related questions, see Orlov and Sherbin ([^, 
differential equations for the spectrum of Wishart matrices and for the sample 
canonical correlations of Gaussian populations. 

Acknowledgment: The authors thank Professors I. Gessel, S. Helgason, 



B. Lian, G. Schwarz, R. Stanley and C.-L. Terng for useful advice, especially 
regarding section 1. 
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1 Integrals over Grassmannians 

Consider the Grassmannian Gr(p, F") of p-planes through the origin in F", 
where F = C, M or EI (= quaternions). Let G be the group of matrices A, 
with entries in F, such that A'^ = A\ withQ A^ := A~^ . Matrices AeG will 
be represented as block matrices 



p n—p 
> < > 



( to P eG. (1.0.1) 

\A2i A22 J In-p ^ ' 

The main statement of this section is theorem 1.1, where it is assumed, 
without loss of generality, that n > 2p. The values of j3 are related to C, M 
and H, as follows: 

Gr(p,C"): p = l 
Grlp,W): P = l/2 
Gr{p,ir): (5 = 2 

The geometry of the symmetric spaces G/ K and K\G/K has been stud- 
ied by Helgason |2^, In his recent Princeton thesis, Duenez (|TB|) has 



systematically studied integrals over symmetric spaces. Explicit information 
on this subject is not readily available in the literature; therefore we explain 
the theory in the Grassmannian case and the useful aspects for our purposes. 

Theorem 1.1 Consider the two parametrization of Gr{p,¥"') 

Affine G'r(p,F") = S^span ^ ^ Z := Z{A) = ^21^^/, A G cj , (1.0.2) 
and its invariant measures dfi{Z). Then, forp <q< n/2, we have 

/ e^S?^»|Ap(^)|2/5nzf''-^+^)-^(l-z,)^('^-^-^'+i)-Mz,. 



L 



Gr(p,F") 



:i.0.3) 



■^Given ai G M, we define for a = oq + ia\ G C, a = aq — ia\ and for a = ao + a\e\ + 
0262 + 0363 e H, a := flo - aiei - 0262 — 0363. 
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This is the "Fourier transform" of the joint density of the sample canonical 
correlations {zi, . . . , Zp) = {rf, ... , Tp), for j3 = 1/2, 1, 2, for the real, complex 
and quaternionic cases, given that the canonical correlation coefficients are 
zero, (see (0.0.21) and section 5) 



Consider the following block matrix 



Ip O 

O -Ig 



Theorem 1.2 An alternative description for Gr{p,¥^) is given by 

Gr{p, F") ~ 5 := |m = AIp^r^-pA'^ Ip,n-p | ^ e c} (1.0.4) 

The n X n matrices M have an — 2p- dimensional eigenspace corresponding 
to the eigenvalue 1, so that M can be decomposed into M — Mq Mi (with 
Ml corresponding to the 1- eigenspace), with Tr{M — Mq) — n — 2p. Then 
we have 

= / e^^^'|Ap(^)|2/3f]^f^-P+^)-^(l - Zif^''-^-P+''>-'dzi. 
J\o,i]p 1 



(1.0.5) 



Theorem 1.3 Considering the parametrization 

TidGr{p,¥'^) = {Z arbitrary {n — p) x p matrix, with Zij e F}, (1.0.6) 
we have 

e-^-^^^^detiZ^ZfPdi^iZ), EcR+ 



I 

with spectrum (ztz)eE 



f |Ap(ii)|2/5f[e-^"'iif"-^+')-'dii,. (1.0.7) 

Jep I 
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For (3 = 1/2 and x = this is an integral of the joint density (Wishart 
density) (see section 5 and Muirhead p. 107) of the eigenvalues ui, . . . ,Up 
of the matrix A = Z'^Z , where Z is a n x p matrix (p < n), with Gaussian 
density centered at and covariance XIp, namely the density 

l<i<n 
l<j<p 



Proposition 1.4 Then 
Gr{p, 
Gr{p, M" 
Gr{p, IT 



U{n) 



U{p) X U{n — p) 

SO{n) 
SO{p) X SO{n-p) 

Sp{n) 



Sp{p) X Sp{n — p) 
and the affine part can be parametrized as follows: 



: G/K 
=: G/K 
■■G/K, 



[1.0.8) 



Affine Gr(p,F") = S^span \ z = Z{A) = A^iA^^ A G g| 



and 



K\G/K = < 



span 



( — tan 6*1 
O 



O 



tan 6„ 



V 



On—2p,p 

with < 6j < n 



[1.0.9) 



TidGr{p,¥^^) = {Z arbitrary {n — p) x p matrix, with Zij G F}. (1.0.10) 
Setting Id = ({5). 
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Setting, for the respective cases of Gr(j),¥"') and Ti(iGr{p,¥^), 



{yi,... ,yp} = spectrum 
{ui,...,Up} :■■ 



spectrum (Z^Z) 



(cos26'i, . . . ,cos26'p 



with —l<yi< 1, and < Ui < oo, Haar measure on Gr(p, F") and 
TidGr{p,¥^) reads, setting k — n — 2p, (Weyl integration formulae^ 



Gr{p,¥-) : = \Ap{y)\'('ll{l - y.f'^^f'-'^l + y,f-'dy,dK 



TidGrip, F") : df,{Z) = \Ap{u) J] uf^'^^-'^^udK, 

1 

leading to the table: 



G/K 


induced measure d/j, on K\G/K 


dji on TjdG/K 


Gr{p, C") 

Gr{p,W) 
Gr{p,W) 


|Ap(?/)in?(i-yi)^^'-'ni + yi)-^% 

Ap(#m(i-y.r+ni+y.)% 


Ap{uf\\{u^^dui 

W{u)\W^uf^~^^ dUi 
\{uYmuT^^du, 



Table 1 

In the other description (1-0.4) of Gr{p,¥"'), given by 



Gr{p,¥") ~ 5 {m - AIp^^-pA-'l, 



p,n—p 



AeG 



an appropriate left action of B E K on A E G, amounting to conjugation in 
S, leads to the matrix in the torus 21 C G, 



[BA)Ip^n_p(BA) ^Ip^n-p — l^{AIp^n-pA ^Ip^n-p)B 



O 



\ 



O In-2p ) 



e 21, 
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with Qp :— diag(^i, ... ,9p). 



Proof of Proposition 1.4-' The p columns of the nxp matrix A i ^ ] , with 



O ^ 

A G G, and O := On-p,p (a zero matrix of size {n—p,p)) span a ]9-dimensional 
plane in F", so that 

Gr{p, F") = jspan A ^ ^ , with A e cj . 
The right action ol B G K on A G G acts on the p-plane A f ^ j as: 



and therefore it has no effect on that plane 

span AB ^ ^ ^ ~ span A ^ ^ ^ Bn = span A ^ ^ 

since multiplication to the right by Bn merely replaces the p columns of 
^ O ) P hnear combination. Then the nxp matrix V :— A ^^^^ 

satisfies 

V^V = (Ip O) ^ ^ = {Ip O) 4 ^ ^ = (1.0.12) 

Conversely, we show that 

span Ai(^^^^ ^ span A2 ^ ^ ^ (1.0.13) 
imphes A^^Ai e K ^ Ki x K2 Indeed, (1.0.13) holds if and only if 

^ O ) ~ ^ O ) ^' ^^^^ invertible p x p matrix g. (1.0.14) 



15 



Adler-vanMoerbekeiGrassmannian October 8, 2001 



§1, p. 16 



Then we prove g E Ki. Indeed, from (1.0.13), the matrices := ^ ^ ^ 

satisfy VW = Ip and so Vi = imphes g'^g = {V2gyV2g — ViVi — Ip. 
Multiplying (1.0.14) to the left with A:^^ yields 



O 



9 
O 



and thus G 3 ^Ai 



g * 
O h 



the fact that the latter matrix is in G, implies * = and g e Ki, h E K2. 
This means that A^^Ai e K and so Gr{p, F") ^ G/K. 

To describe AfRne G'r(p, F"), notice that a plane ^4 ^ ^ ^ for which 

det All 7^ 0, can be expressed as 



G/K 3 span^ ^ ^ ^ = span ^ 



All 
A21 



span 



All 
A21 



All — span 



Z{A) 



where Z{A) :— A2iA]~/ is a (n — p) x p matrix. Also notice that Z{A) is 
unchanged upon multiplying A to the right with B E K. 

The left action oi B E K on A E G has the following effect on 



Affine G/K 
because in 

BA 



O 



Affine G/K : 



Bii O 
O B22 



z 



All 
A21 



B22ZB11 



BiiAii 
B22A21 



;i.o.i5) 



we have Z{BA) = {B22 A2i){Bii An)"^ = B22Z{A)By^. Picking arbitrary 
matrices Bn E Ki, B22 E K2, the (n — p) x p (n > 2p) matrix Z{A) can be 
"diagonalized" , namely 



Z{BA) = B22Z{A)B^^^ 



O 



o \ 



OLr 



o 



;i.o.i6) 
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Now we use the fact that the p x p matrix Z'^Z is "self-adjoint" and 
positive definite^, and so, setting =: — tan6'i, 



{Z{BA)yZ{BA) 



( Oil 



O \ 



at 



O 



O \ 



tan^ Or. 



;i.0.17) 



Therefore, by the left action of K on G, the p-plane in F" can be represented 
by the span of the columns of the following matrix, which by taking the linear 
combination of the columns, each multiplied with cos 6i reads 



/ cos 9i 





/ 






\ 












— tan 6i 












cos 9p 


span 







— tan 




= span 


— sin 9i 







V 


n—2p,p 




) 







— sin 9p 



On- 



n—2p,p 



and so 









1 span 




, < 9i < 




\ On-2p,p / 





Similarly, Z G TjdGr{p,¥^) can be diagonalized by the action (1.0.16) on 
T„Gr(p,F"); i.e.. 



-1 



Z I— > i?22^-Bii 



o \ 



O 



(1.0.18) 



where the Zi are linearized versions of the tan6'j's and where a"*" is a fixed 
Weyl chamber in the Cartan of p. 



^since v^Z'iZv = {Zv^Zv = '^\{Zv)if > 0, for v G FP\0. 
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By the Weyl integration formula, the measure induced on K\G/K, via 
the Haar measure and the embedding G/K ^ G, is given by (in the compact 
case) (see Helgason [^], p. 188) 

dij{H) = Y[ \ sin a{iH)\"''^dH, H G a„ 

aGS+ 

where g = i. + p, with compact real form u = t + p^, p^ = ip and with a* a 
maximal abelian subspace of p, where S+ is the set of roots having positive 
values on the fixed Weyl chamber a"*" of a* and where the root space Qa has 
dimension rria for any restricted root a: 

g^:={X eg \ [H, X] = a{H)X, for all H G a}, 

and we also have the induced measure on Tjd{K\G / K) (|2^, p. 195) 

rf/i = JJ a{HY'-dH, H G a+. 

The roots and multiplicities nia are as follows {n > 2p): 



Gr{p, C") 


Grip, M") 


Gr{p, H") 


a G E+ 


2 


1 


4 


i{ej + Ek) 


2 


1 


4 


i{ej - Ek) 


1 





3 


2iei 


2{n- 2p) 


n — 2p 


4(n - 2p) 





with l<j<k<p, l<i<p, yielding Table 1, upon setting 

H = i{ei,... ,9p),0<9i<Tr, (1.0.19) 

e.g., we check table 1 for Gr{p,IP). Setting k = n — 2p and yj = cos29j, 
which is very natural in view of (1.0.20): 
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for K\G/K: 



I sm{ej - 0k) siniOj + 0k) 1^ Yl I sin 2^jf | sin^ 0j\^^''-^P^d0j 

l<j<k<p j=l 

= n I 9 2^,- - cos 20k) I' JJ(1 - cos' 20^) ( """"^ ^' ) -d COS 2^,- 

l<j<k<p j=l ^ ^ 

= 2-^'(2(^+^)-i)A(y)^ + - y^f'^^^dy, (1.0.20) 



for TidK\G/K: 

d^x ^ \[ a{H)'^'^dH 



a6S+ 



c n {v,-Vk)\v,-rVk)'\[vf^'dv, 

l<j<k<p j=l 



n («i-''3'n(«i)""'>,^ 



= c 

l<i<A;<p j=l 
P 

= c2-PA\u)l[uf+'duj, (1.0.21) 

i=i 

setting Uj = v^. where in the above we made the identification (1.0.6) of 
Proposition 1.3 and put Z in the normal form (1.0.16), so that Z'^Z = 
diag(iii, ... ,Up). 

To describe G/K in a second way, remember g = t + p, with t,p the ± 
eigenspaces of a he algebra involution a. The latter lifts to the group as an 
involution a, which commutes with inversion, i.e., {g'^)~^ = {g~^)'^. Use a to 
define the following embedding 

f.G-^G-.g^ i{g) = g{g'')-\ (1.0.22) 

which induces a natural injective map 

i:G/K^G:g^i{g). 
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Indeed, L{gi) = i{g2) is equivalent to gi{gi) ^ — g2{g2) which amounts to 
92^91 = i92)~^9i = i92^T9i = i92^9iT, meaning that g^^gi E K. 

From the polar decomposition G = KAK, we have that every g E G can 
be decomposed into 

g = k^^A{e)k^\ ki, k2 e K, A{e) e A, 

with e the torus coordinates, such that A{e)A{e') = A{e^e'), A(0) = / and 
^-1(61) = A{-9). Since A = cxp a, with a e ip, we have A(ey = A~^(e). 
From (1.0.22) the torus A embeds into S as follows: 

l{A{9)) = A{9){A''{9))-^ = = ^(2^)- 

Moreover the polar decomposition g = k^^A{9)k2^ yields conjugation in 
S by ki : 

i{g) = ki\k,gk2k^\gn-'ki')k, 

= kT\k,gk2{k^)-\gn-\k^i)-')ki 
= k^' {k^gk2{{kigk2y)-^) ki 
= k^^ b{kigk2)ki 
= k^h{A{9))ki 

= k^^A{29)ki (1.0.23) 
Specializing to the Grassmannian case, we have the involution 
a : G ^ G : g ^ g'^ -.^ Ip,n-p 9 Ip,n-p 



with 



K — {fixed points in G of the involution cr} 
= {g eG such that g Ip^^-p = Ip,n-p 9}- 



Setting 



Op := diag(ei, . . . ,9p), 
we have that a maximal abelian subspace of the Lie algebra is given by 



a{9) 



/ 


Op 


Op 


\ 








o 




-Op 


Op 




\ 




o 


In-2p ) 
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and by exponentiation, we find the torus 



21- { A{e) 



\ 



o 

O In-2p J 



, <9 <27i} e G. 



The spectrum of A{9) is easily seen to be 

\^ , . . . , O , O , . . . , O , i, . . . , -L I. 



[1.0.24) 



n—2p 




To connect with the previous description, given g & G,we pick ki, k2 such 
that kigk2 — A{9) and by the previous discussions, we have 

Z\g)Z{g) = Z^{k,gk2)Z{k^gk2) 
= ZHA{e))Z{A{e)) 
= diag (tan^ Oi, ■ ■ ■ , tan^ 9p). 

The embedding 21 Gr {p, is then given by 

^^Gr{p,W-):A{9)^A{9)( ^^ ) = 

and so the Qi% in the two discussions are identical. ■ 

Proof of Theorems 1.1, 1.2 and 1.3: In order to compute integral (1.0.3), 
recall in the above description of K\G/K, 

H = iiei,... ,ep). 

Since the integrand is invariant under the left action of K on G, which induces 
on Z the map Z — * B22ZBi^, use (1.0.17), from which it follows that, upon 
using tan2 Oi = {'"^le, = TtS' Oi = {1 + i/,)/2,we have 

i-yi ^-yp\ 



Z^'Z = diag(tan^ Oi, . . . , tan^ 6p) = diag ( - 



+ yi 
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and 



/ cos^ ^1 



O \ 



-diag(l + yi,... , 1 + yp) . 



y O cos^ 9p J 

Hence, setting Zi — cos^ Qi in the last identity below, we have yi — Izi — \^ 
using (1.0.11) and picking an appropriate normalizing constant c: 



iGr(p,IF") 



= C 



[-l,l]f 



j=i 



= c / e-^^^|Ap(^)|2m(l-z,)'^("-^'-«+i)-izf^-^+')-^dz,, 

i[0,l]^ 1 

upon setting yi = 2z,i — 1. 

Finally, using the Weyl integration formula (1.0.21) on the tangent space 
and identifying the (2:1, . . . , ;2p) in (1.0.18) with a point in o^, and setting 
= 2;^ > 0, 1 < i < p, we find 



{l-y,f^-^^\l-y^Y-' 



/ 



-xlrZ'^Z 



with spectrum (ztz)^E 



det{Z^ZfPdi^{Z), 



Jep I 



establishing Theorem 1.3. 

Finally, to prove Theorem 1.2, notice that, according to (1.0.23), matrices 
M in 5 can be diagonalized to matrices A{29), with spectrum 

(e^'^\. . . , e2^^^ e-^'^\ e-'^^^ 1^_^). (1.0.25) 

n—2p 

So M decomposes into M = Mq^Mi, with Mi being the n — 2p-dimensional 
eigenspace corresponding to the eigenvalue 1 and so 



^ ^ - ^ l-cos2^^,. Afl-e2^^^)(l-e-2*'^^' 



^ 1 + ~ J-l 1 + cos2^fc ~ J-1 (1 + e2''^0(l + e-2'^*) / + M 



det 



I-Mn 
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2 Jack polynomials 

2.1 Young diagrams and Schur polynomials 

Standard references to this subject are MacDonald, Sagan, Stanley, Stanton 
and White |34, 41, 44, To set the notation, we remind the reader of a 
few basic facts. 



• A A partition oi n = |A| := Ai + ... + Xe (with n = |A| called the 
weight) is represented by a Young diagram \i > \2 > ■■■ > \e > 0. A 
dual Young diagram A""" = (A7 > AJ > ...) is the diagram obtained by 
flipping the diagram A about its diagonal. 

• A semi-standard Young tableau of shape A is an array of positive in- 
tegers Ojj placed at {i,j) in the Young diagram A, which are non- 
decreasing from left to right and strictly increasing from top to bottom. 

• A standard Young tableau of shape A is an array of integers l,...,n 
placed in the Young diagram, which are strictly increasing from left to 
right and from top to bottom. 

• The Schur polynomial sx associated with a Young diagram A is a sym- 
metric function in the variables Xi,X2,..., (finite or infinite), where 
n = |A| and defined by (for notation /'^, see the next point) 

sxixi,X2,...)= ^ JJ Xa,^^. = xi . . .a;„ + ... 

{a^ j} semi-standard (i,j)^A 



The hook length of the i, jth box is defined by hfj := Aj + A^- —i—j + 1. 
Also define 



:= n hi 



rm / 



Arn{rn + Ai — 1, . . . , m + A^ — m 



5 



for m > A7.(2.1.1) 



• The number of standard Young tableaux of a given shape A = (Ai > 
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> A^) is given by 

— 7^{standard tableaux of shape A} 
= lAI! sx(x) ^ 

Efc^i.=0 for i > 2 

= coefficient of X1X2 . . . x„ in sx{x) 



lAI 



^m{m + Ai - 1, ■ ■ ■ ,m + Xr. 

m 

Y[{m + Xi-i)\ 



m] 



for m > Xj . 

(2.1.2) 



The number of semi-standard Young tableaux of a given shape A, with 
numbers 1 to A; for k > 1: 



# 



semi-standard tableaux of shape A 
filled with numbers from 1 to A; 

k 

sa(C^,0,0,...) 

n j -i+k 

( Afc(/c + Ai - 1,... ,k + Xk-k) 



k-l 

n 

i=l 



0, when k < Xj , 



using the fact that 



when k > Xj , 



n 0'-^+^)= 



nUik + X, - 1)\ 



(2.1.3) 



(2.1.4) 
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Robins on- Schensted-Knuth correspondence: Given 

Sn = group of permutations of {1, . . . n} 

~ {words of length n built from the set {1, . . . , fc}} 

the following 1-1 correspondences hold: 

g J {P,Q), where P and Q are two standard Young tableaux 
of same shape A, with |A| = n and taken from {1, . . . , n} 



(P, Q), where P and Q have same shape A, with |A| = 
— >■ ' P is semi- standard, filled with numbers from 1 to k, and 
Q is standard, filled with numbers from 1 to n 

It follows that for given n and k, we have 



Awith 

I A I — n 

Y,f'sx{l') = r (2.1.6) 



Awith 

I A I —n 



Increasing and decreasing sequences 

According to Greene |2^, given a word n E S^, mapped, via the RSK 



correspondence, into (P, Q) of shape A = (Ai, . . . , A^), then for any /c, 
Ai + ... + Afc = 



a7 + ... + a^ 



length of the longest weakly 
/^-increasing subsequence 

length of the longest strictly 
fc-decreasing subsequence 



2.2 Some useful formulae on hook length 

Remembering the notation (0.0.8), we have the following statement: 
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n—p 



Lemma 2.1 Given a partition A ^ /x := (n — pY with Xj = p, < p < n, 
thet^ 



[n 



(p)av 



n — I] 



p-i 



and Sfj_{V) = 1, 



Proof: Setting k = A\/i 



n 4.) 

(«j)G/i 



p n—p 



Y[Y[{n + l + Ki-i-j) 



i=i j=i 
p 



]^(n + Ki - i) . . . {n + I + Ki - i - n + p) 



1=1 



i=l 
P 



{p + 1 + Ki - i) . . .{n + Ki - i) 
{p + 1 — i) . . . {n — i) 

{n-i + 1)^^ 



Here {a)^ := (a)i'^ =]!.(« + 1 - ^)a. ■ 



(2.2.1) 



(2.2.2) 



(2.2.3) 
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h^" = [(n-l)(n-2)...p][(n-2)...(p-l)]...[(n-p)...l] 

(n-l)!(n-2)!...(n-p)! 
(p-l)!(p-2)!...l! 

f[{n-i)\ 



p-i 

1 



(2.2.4) 



Then 



n U-^+P) f[(n-i)\ 
1 

p-1 



= ^^^^^^^^^^ = ^ = 1, using (2.1.4). 



1 



Finally, using in the second identity below formula (2.1.4) and in the 
fourth identity Ylii''^ — i + K,i)\ — Ylii''^ ~ '^V-Ylii''^ — i + 1)^: one computes 

n (j-'^+p) 

1 

p 

Win - i + KiY- 

^ 7TJc:2~ — ^1 ' ^^^^^ Xi = n-p + Ki 

^^! 
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1 

_ ft" ^ ((P- ' + ' W)' , uang (2.2.1) and (2.2.2) 



ending the proof of Lemma 2.1. 

2.3 Jack polynomials 

Define symmetric polynomials 



permutations 

of Xl ) • • • ) 

and the dominance ordering between partitions: 

e e 

H < \ means : ^^A*i < X^'^*' ^ ^' 
1 1 

Given that A has m.i = m,i{\) parts equal to define the inner-product 
( , ) on the vector space of all symmetric functions of bounded degree (to be 
explained) 

(pa,Pm) = <^A^(1"'^2-^ ■ • ■ )mi\m2\ ...a^^. 
Jack polynomials are the unique symmetric functions J^""* satisfying 

(i) (Jf ,4")) = 0,if A^/i, 

(ii) = EM<Af^AM(a)^M> 

(iii) If |A| = n, then 

t(c«) I I 

= nlXi . . .Xn + ■ ■ ■ 
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It follows that 



n (Aj-^ + l + a(A,-j))- 
(j,j)eA 



Special cases: 

Jack polynomials for a = 1 are proportional to Schur polynomials, namely 

Zonal polynomials are given by 

Zf^j^yp)^ with /3 = 1,1/2,2. 

They have the remarkable property that for G = 0{n), U{n) or t/(n, H) := 
{9 I Sf = /}, 



for all 



cr, r G S = {real symmetric matrices} for a = 2 

= { Hermitian matrices} for a — 1 

— {quaternionic matrices, with a — a^} for a — 1/2. 

The function (r) is a symmetric function of the (real) spectrum of r. 



Orthogonality: 



= (2.3.2) 



where 



Jx^ = n (A7-i + a(A,-i + l))(Aj-i + l + a(A,-j)) 

(i>i)eA 



(/l^)2 


for q; 


= 1 




for a 


= 2 


^2AT/22|A| 


for a 


= 1/2 



(2.3.3) 



29 



Adler-van Moerbeke : Grassmannian 



October 8, 2001 §2, p. 30 



Special values: 

For arbitrary n, we have 



H (n-(^-l) + a(j-l)), wherer = (l,... ,1,0,0,...) 

(*j)eA 



n (^("-'+1) 



a 



a/ A 



a 



n T."/!/,. , , , ,^ > , for all m > Xj , if n > Xj , 



\ r(i(n-z + l)) 
, ifn<Xj 



(2.3.4) 



and so, for a = 1, 
1 



= < 



1=1 ^ ' 
, if n< A7. 



> , for all m > A7, if n > A^, 

(2.3.5) 



The last identity in (2.3.4) is obtained by taking the product over the ith 
row of A and using {x + n){x + n — 1) . . . x — When the Gamma 

functions blow up, the formulas must be understood as limits. Also 



Expansion of (xi + 2:2 + . . . )": 



{X1 + X2 + ... )" 



a 



4!^ 

-(a) ■ 

|A|=n 3x 



(2.3.6) 
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Then also 

(Xi + ^2 + ...)" = 

Cauchy identity: 



J2 Ci''\x) , with (x) = ^^Ji-\x). (2.3.7) 

A|=n 



JrWT\y). 



(jj)>i 

in particular, for a = 1 



AeY Jx 



i,j>l AeY 



Hypergeometric functions 



Generalized hypergeometric functions 2-^1"^ are defined by: 
2Fi''\p,q;n;x) := ^ 



KeY Ji^ 



a' 



For a = 1, using ji^^ = h'^s,^ and Jk^'' = {h'^y, we have 



(1) _ {hK\2 



2Fi^\p,q;n;x) := ^ 
and so, upon restriction. 



KeY 



77, 



2Fl^\p,q;n;x) 



Y,i xl=5iiU 



KeY 



(2.3.8) 



(2.3.9) 



(2.3.10) 



(2.3.11) 



(2.3.12) 



Generalized Selberg formula: 

Kaneko computes the following integrals, subjected to the condition that 



a, 6 > /3(p - 1) (see also MacDonald H and Kadell |2|): 
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f 1 28 ^ 



p 



T{tp + 1) r(a + p{l - i)) T{\i + b + Pil- i)) 



i=l 



T{(3 + 1) r{Xi + a + b + f5{l - i)) 



(2.3.13) 



Setting u = {ui, . . . ,Um), we have the following representation in terms of 
the hypergeometric function: 

l<k<m 

- 2F, iPp,b,a + b,u)[[ r(/5 + l) r(a + 6 + /3(l-.)) " 



(2.3.14) 



3 Probability measures on partitions 



3.1 Probability measure on the set Y of all partitions 

In view of formula (2.3.8), define the (not necessarily positive) probability 
measure on the space Y of Young diagrams, depending on x,y and a: (see 



10, 11, 12, BO, m 



P^^)__^Wa%L, AgY. (3.1.1) 

In particular, evaluating P(A) along the locus 

for X = (xi, X2, . . . ) such that ^^x^ = Suu, we have J^\x) = m''^' 



e 

p 



for y = (1, 1, . . . , r, 0, 0, . . . ) = F, we have j1"^(F) as in (2.3.4), 

(3.1.2) 
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and usingQ 



i,j>i j=i i>i 



i>l 



we obtain the genuine (> 0) probability measure for m > on the space Y, 
depending on m G M and the integer p > 0, 



pu/a„,\\\ J\ ^(1^) 



U 



■(") 



-pu/a iPu/^y^^ pe,p^^^^ with i:= |A|, (3.1.3) 
|A|! 



with 



(support Pu,p) C Y^^*) := {A G Y such that Xj <p}. 
Probabihty (3.1.3) can be viewed as a Poissonized probabihty of 



pe,P^X) 



3[°'\p/ay 



for A G Y^f\ 



Probabihty P„,p(A) is > 0, because, from (2.3.4), 4°^(1^) > 0, for p > Xj 
and = otherwise. 

In particular, setting a = 1 and using J^^ = h-'^sx and j^^^ = {h^Y, 
(3.1.3) leads to 



(3.1.4) 



Pu,p{X) 



a=l 



^puf^X 



|A|! 



P^'P(A) 



where by (2.1.2) 



P^'P(A) 



/"sa(F^ 



a=l 



P 



A G Yf . 



A G Y^P), 
(3.1.5) 



(3.1.6) 



Probability (3.1.6) will be considered next. 

j CO £ 



i>l 



i=l i>l 
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3.2 Probability measures on the set of partitions of 
i and random words 



From (3.1.6), setting in this section P^'^ := P^'^ 

rip) 

, iUl /\ C II 

-\ih)' r>^i by (2.1.2) and (2.1.3), 



pAp(A) = ' ■2)^ \ for AG Y 



f ^a(F) 

= probability on Young diagrams A G Y^^^ coming from the 
uniform distribution on S^, via the RSK correspondence, 

(3.2.1) 

where h := {hi, hp) with hi := p + Xi — i, and where 

5*^ := {words of length £, built from an alphabet {1, , 

with \Sf \ = p^. As already pointed out, this is a probability, firstly because 
of the connection with the word problem, secondly because of (3.1.5). As 
already pointed out, this probability was considered in |Ty, |Tl], |I2|, |31| 
and also in the context of random words, by Tracy and Widom (|[48||). 

Proposition 3.1 Given rectangular Young diagrams fi = {n — pY ^ = 
{n — qY , with p < q < n, and I > p{n — p), the expectation equals 



^! A (g-^)! V 1 fr {p-t + lU{q-t + l).^ 



Proof: From (2.2.1) with A i— > A\/i', (so n q, p p) we have, upon setting 

K = A\/i, 



34 



Adler-van Moerbeke : Grassmannian 



October 8, 2001 §4, p. 35 



Then combining (3.2.3) and (2.2.3), 



n 

1=1 



{q-i + l)^^{p-i + l)^^ 



(n-i + 1)^, 



In particular, setting X = n, and using (2.2.2), 



A=/i 



/i^^/^' _ A {q-i)\ 
{n-i)V 



and therefore 

sa(F)/i^ 



A=/i 



(/.AV)2 11 



(n - i + 1)«, 



(g-i)! 1 -pr (p-i + l)«,(g-i + l)«, 
,=1 (n-i)!(/i^\'^)2J-L 



(n - i + 1)„. 



(3.2.4) 



Then, taking the expectation E^'P with regard to the probabihty measure 
P^'f, defined in (3.2.1), 



E^'^ (/{A5.}(A) n 4.)) 

\ n-q<j<n-p J 



E 



, using /^=^ as in (2.1.2), 



^!-rT (g-0! v 1 -TT 



^ (p - i + l)(AV).(g - i + l)(AV)i 



i=l 



(n-i + l)(AV). 

using (3.2.4) 



^!-rr (g-0! 



^ — p(n — p) 

k7<p 



11 



(n - i + 1)«. 
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4 Expressing integrals as mathematical ex- 
pectation on partitions and on random words 

4.1 Expressing an integral on Gr{p,¥'^) as a mathemat- 
ical expectation on partitions 

Remembering the probability (3.1.3) 

P.,,(A) = e-^^^ ^j^, AeY, (4.1.1) 

Jx 

on partitions Y, with support on Xj < p, the following statement holds: 
Theorem 4.1 For fixed p < q < n/2, the following holds ((3 = l/2,l,2j 

/ e^^(^+^'^)"det(ZtZ)-^(«-^')ci/x(Z) (4.1.2) 
= 2F[''^\(3p,(5q-(3n-y) (4.1.3) 

where c := Cn^g,p «s as «ra (0.0.20). 

Proof: For a symmetric function /(-Zi, ... , Zp), define the integral, depending 
on 

(/) := / f{z,,...,z,)\^{z)\'^\{zf^'^-'^'^-\l-z,f^--^-^^^^^^ 

(4.1.5) 

Kaneko's formula (2.3.13) will be used for a = I3{n — q) and h = I3q 
in the sequence of identities below; the inequalities p < q < n/2 imply 
q — p + 1, n — p — q + 1 > 1, and so the integral (4.1.5) above makes sense, 
and we first apply (1.0.3) in the following sequence of identities: 
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P 



p \ 



1 



\ / ^ 

= E ^^^) ' ^'^S (2.3.6) 

\^ \\\=e Jx I p 

- 11 r(/3 + i) 



using Kaneko's formula (2.3.13), 

^r(z/3+l)r(/3(n-g-z + l)) . 



n 



r(/3 + i) 



i=l 

1 ^ rr\ _i_ /^/' --Li^^ using the definition 

y J-P.p(A) n , ;r ~ ' . (4.1.1) of probability, 

p/3p^^rP (\\^!a!!1!^^-L_ using formula (2.3.4) and 
^ /?l'l4'^''^(l")' ^^^^^ of c^i^" 

yielding (4.1.4). 

Finally, looking at the expression to the right of =, we find, using (3.1.2) 
and (2.3.4), to be precise, 
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the following 



n 



, by (2.3.4), 
, by (2.3.10), 



thus ending the proof of Theorem 4.1. 



Remark: Identity (4.1.3) is also an immediate consequence of Kaneko's for- 
mula. 



4.2 Expressing an integral over Gr{p, C") as a mathe- 
matical expectation on partitions and random words 

We now specialize the previous section to F = C. For fixed integer p > 1 
and P — I, recall the probability (3.1.5), with support in Y^*'^ 



P^.W - - e--^:^P^-(A) , A e Y(-), (4.2.1) 

and the probability (3.1.6) on Y^^'* coming from the uniform distribution on 
Sf, via the RSK correspondence. 



P 



(4.2.2) 



For integer < p < n/2, consider the fixed rectangular Young diagram 
l_i^(n- py. 
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Theorem 4.2 For fixed p < q < n/2, 

I gxTV(/+ztz)-i det(ZtZ)-(<?-f)ci//(Z) 

iGr(p,C") 



'-'n,q,p 
^(n-p)p 



n — q<.j<n- 



= < 



E 



{px) 



t>p{n—p) 



^{A3M}(A) n ^ 

V 



A 



\ 



3) 



71— q<j<.n—p 



,p 



1 ip)n{q)K 



kJ <p 



Cnl,p 2Fl^\p,q;n;y) 



(4.2.3) 



In particular, for p — q, the integral above has two different formulations, as 
a probability or as a generating function of probabilities. 



JGr(p,C") 



{pxY 



-^n,p,p 



E 



xiri--p)P ^ 11 
i>p{n—p) 

^n,p,p X (px) 
r^{n-p)p 2-^ £\ 

£>p(n—p) 



P''" (AD//), 



71 eS^ 



di{7[) = p and 
V-i(^) < £ - n + pj ' 



1 ((P).) 



-n,p,p^- ^ r^^y 



2Fi^\p,p;n;y) 
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(4.2.4) 

where P^'^ denotes the uniform distribution on the set of words of length 
£ from an alphabet 1, . . . ,p, and 

di{Tr) — length of longest strictly decreasing subsequence 

ifc(7r) = length of the longest union of k disjoint weakly increasing subsequences. 

Remark: The constant Cn,q,p in (4.2.2) and (4.2.3) is tiie same as cij^l^p for 
P — 1 (see (0.0.20)) and Cn^q,p is a new constant: 

(1) -uf ^--^ 



nf . ) Cn,q,,^i[i\{n-q-i)\. (4.2.5) 

From the fourth expression of (4.2.2), it follows readily that, near a; = 0, 
(cW ,)-^ / e^iv(.+zt^)-i det(ZtZ)-(«--)d/.(Z) 

JGr{p,C^) 

, PQ /(p+i)(9+i) , (p-l)(9-l)^ 2 



= l + ^x + ^ r" ' + \ + (4.2.6) 

n 4n\n + l n-1 J ^ ' 

Proof: From the last identity in (2.1.2) with m = p, it follows that, since 

^^^/r(i+A^ .427) 

i=l ^ ' 

For a partition A such that \[ <p and for A' = A + A;^, with arbitrary integer 
fc > 0, we have s\{y) = sx'{V), using the last identity (2.1.3). Using these 
facts, we have, continuing from (4.1.6), 

r g.iv(7+ztz)-i aet{Z^Zy-^dix{Z) 

JGr(j>,C") 

r ^ 

- ~ ST -^^^^^^ yr {Q + ^-^V- by the 4th identity of (4.1.6) and 

- ^-'^'^Z.^ ~^ll(n + A,-i)!' J« = and = (/.^)2 

= gn..,pE^'";^ /,A+(n-p). ' using (4.2.7) 



AeY 
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, |y|_(„_,), 5v(F) h>^'-^--'iy setting y = A + (n - pf 

-'n,q,v 2^-^ ^A' ^\'-(n-v)v^ and using Sa(P) = S^' (F) 



A'eY 



C P- 



E(\')l!——^ using P^,p defined in (4.2.1), 

^, -.- . ^"^'P^^^h^'-in-py^ ' and |A'| = |A| + (n-p)p, 



.px 



(4.2.8) 



= &^- Ua3.(a) n "1,. ■ 

\ n-q<j<n-p / 

The last equality in (4.2.3) follows from (4.1.3), while the second to the last 
one follows from (2.3.12). 

Finally, to prove the second formula on the right hand side of (4.2.3), 
start with equality = in (4.2.8), omitting c„_g_p, and replacing A' by A, 



— — r ^x''^' "'^y. ' rr, for flxed ii^{n — -pf 



j^\-{n-p)P ■ 

1 ^{pxp\X\\sx{l^) 

n—p) Z-^ 



ASY 
ADfi 



^p(n-p) U|! f^X p\X\ 11 'Hi,J) 

ASY I I ^ ii,j)eX 

AD/i n—q<j<.n—p 



n 



(n-p) 2^ P\ 2-^ 



{pxY ^ /\sa(1") 



xPin-p) US ^ n ^(M)' using (2.1.2), 

XDfi 71— q<j<n—p 



— y 

(n—p) Z_-^ 



{pxY 

j.p{n-p) 2-^ £\ 
£>p(n—p) 



( 



\ 



E 



\ 



(»,j)eA 

n—q<j<.n—p 



I 



In particular, setting q — p, the latter equals 

£>p(n— p) 
(n—p) Z-^ 



rj.p(n-p) £1 
^>p(n— p) 



-P^'^(A D A7 = p) 



^ E ^P'''{^^S^t ip-i(7r)<£-n + p). 



2;p(n-p) £1 
^>p(n— p) 
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To see the last two equalities, one proceeds as follows. From A ^ /i = 
{n — pY and P^'^(A) = for > p, it follows that Xj = p and by Greene's 
theorem (see Sagan p. 110), di{7i) = Xj = p. Since also by Greene, 
^A:(7r) = Yli -^j' ^^^^ particular ip{7i) = Yli = ^5 have 

Xp = ipirr) - ip_i(7r) = £ - 2p_i(7r). 

From A 3 yU = (n — p)^, it also follows that Xp > n — p, and thus ip-i(7r) < 
£ — n + p. Conversely, if di{n) = Xj = p, and ip„i(7r) < i — n + p, then 
^^^""^ Ai = ip-i(7r) < Aj — + p; hence Xp > n — p, and so A 3 yU. ■ 



5 Testing Statistical Independence of Gaus- 
sian Populations 

The statistical facts, used in this paper and summarized in this section, are 



due to James [25| and Constantine 13]; see also Muirhead [35 



5.1 The Wishart distribution 

Let the pxn matrix X, with n > p and n identically distributed independent 
columns, have the normal distribution 



(det 27rS)-"/2e-5 



TrS-i(X-J\/)(X-A/)^ 



(5.1.1) 



Then the p x p matrix S = X has the non-central Wishart distibution 
with n degrees of freedom, with px p covariance matrix E and non-centrality 
matrix Q = M~^T,~^ , namely 

rp(n/2)-i(det2S)-"/2e-T'^(^+^^''^)(det5)^(«"P-i)oi^i ^^-'^S 

where S > 0, where F^ is the multivariate Gamma function and whereQ(for 
the definition of C\, see (2.3.7)) 



^ (i)JA|! ^ 



AgY 



AeY 



^Here {a)x := n»(«+/3(l-*))A, for = 1/2, with (x)„ := x{x+l) . . . (x+n-l), xq = 1. 
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When M = 0, we find the (central) Wishart distribution PFp(n, S), with 
p < n, for the p x p matrix S = X X^: 

rp(n/2)-i(det2E)-"/V5T'-s-is^^g^^^i(n-p-i) "Q ^g^^^) 

5.2 The canonical correlation coefficients 

In testing the statistical independence of two Gaussian populations, one needs 
to know the distribution of canonical correlation coefficients. To set up the 
problem, consider p + q normally distributed random variables (Xi, ...,Xp)''' 
and (Yi, Yq)^ {p < q) with mean zero and covariance matrix 



eov ^ V_v_^Sn J:u \ Ip 



Y J \ / I q 

The method proposed by Hotelling EM is to find linear transformations U 



LiX and V = L2Y of X and Y having the property that the correlation 
between the first components Ui and Vi of the vectors U and V is maximal 
subject to the condition that Var Ui = Var Vi = 1; moreover, one requires 
the second components U2 and V2 to have maximal correlation subjected to 

(i) Vart/2 = Varl^2 = 1 

(ii) U2 and V2 are uncorrected with both Ui and Vi, 
etc . . . 

Then there exist Op E 0{p), Og E 0{q) such that 

Si/^^Sl2S22^^^ = OpPOq 

where P has the following form: 
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P 



Pk 



O 



Pk+l 



i y 



o 



o 



\ 



Pp 



p, k = rank S 



12, 



1 > Pi > P2 > • • • > Pfc > 0, pk+1 = ... = Pp = (canonical correlation coefficients), 

Pi are solutions (> 0) of det(S^/Si2S22^S72 — p^I) = 0. 
Then the covariance matrix of the vectors 



U = LiX := Op^^l^^X and V = L^Y := O^T.^^ 
has the canonical form (det Scan = Hill ~ Pi )) 

U ' 



1/2 



Y 



cov 



V 



with 

spectrum 
and inverse 



1,... ,1, l-pi,l + pi, 

q-p 



1 



1 - Pp, 1 + Pp 



m(i-p?)H-^' i< 



-p 



5.3 Distribution of the sample canonical correlations 



From here on, we may take S = Scan- The n {n > p + q) independent 
samples (xii, ... ,xip,?/ii, ... . . 

iner from observing ( ^ , | lead to a matrix 

the normal distribution (p. 79 and p. 539) 



[•^nl, • • • 5 ■^npi Vnlj 



, Vnq) , aris- 
of size {p + q,n), having 
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(2^)-n(.+.)/2(detE)-/^exp -^Tr [x^ y^) ( ) ' ( ^ ) 

= (27r)""(*'+^)/^(det E)~"/^e~5'^ (x"r(s-i)nx+2/"r(s-i)222/+2j/T(s-i)T^) 

The conditional distribution oi p x n matrix x given the q x n matrix y is 
also normal: 

(det27rQ)-"/2e-5^""'(^-^^)(^-^^)^ (5.3.1) 

with 

Q = En - Ei2E2-2'E2i = diag(l - p?, . . . , 1 - pj) 
P = E12E22''". 

Then the maximum likelihood estimates of the pi satisfy the determinantal 
equation 

det(5fi^5i2^2"2^^i'^2 - r^I) = 0, (5.3.2) 

corresponding to 

5*11 5*12 \ f 
SJ2 S22 J ' V yy^ 

where Sij are the associated submatrices of the sample covariance matrix S. 

Remark: The can also be viewed as = cos^j , where the 9i, Op are the 
critical angles between two planes in M": 

(i) a ]?- dimensional plane = span {(xn, ...,a;„i), (xip, ...,a;„p)} 

(ii) a g-dimensional plane = span {(|/ii, ...,yni)^, ■-, {Viq, ■■■.Vnq)}- 

As we shall see, Zi — rf — cos^ 6i are the precise variables Zi appearing in 
section 1. 

Since the (g, n)-matrix y has rank(|/) = g, there exists a matrix e 0{n) 
such that = (1/1 O); therefore acting on x with iJ^j leads to 



q n—q q n—q 



yi/ = {yi I O) I g, = (« U) I p . (5.3.3) 
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With this in mind, 

S12S22SJ2 ~ ^"^Sii 

= xH{yHy {yH{yHy)~^yH{xHy - r^{xH){xHy 

= uvJ — r^{uu^ + vv~^), 

and so the equation (5.3.2) for the can be rewritten 

det{uu^ - r'^iuu^ + vv'^) = 0. (5.3.4) 

Then setting the forms (5.3.3) of x and y in the conditional distribution 
(5.3.1) of X given y, one computes the following, setting H :— Hn, 

Trn-\x - Py)(x- Pyy 

= Ti n-\xH - PyH){xH - PyHy 

= Trfi-i {{u I v) - P{y, \ O)) {{u \ v) - P{y^ \ O))^ 

= i:in-^{u- Pyi){u- Pyi)^ + i:iVt-^vv^- 1] = diag(l - p?, . . . , 1 - pj) 

this establishes the independence of the normal distributions u and given 
the matrix y, with 

u = N{Pyi, Q), v = N{0, Q). P = diag(pi, ... ,Pp). 

Hence uu^ and vv'^ are conditionally independent and both Wishart dis- 
tributed; to be precise: 

• The p X p matrices vv'^ are Wishart distributed, given y, with n — q 
degrees of freedom and covariance Q; 

• The p X p matrices uu'^ are non-centrally Wishart distributed, given y, 
with q degrees of freedom, with covariance ^2 and with non-centrality 
matrix 

^Py,yJP^n-\ 
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• The marginal distribution of the q x q matrices yy~^ are Wishart dis- 
tributed, with n degrees of freedom and covariance Iq, because the 
marginal distribution of y is normal with covariance Ig. 

To summarize, given the matrix y, the sample canonical correlation coef- 
ficients rf > . . . > are the roots of 

(r^ > . . . > Tp) = roots of det{xy~^ {yy~'^)^^yx~'^ — r'^xx'^) = 
= roots of det(uu^ — r'^{uu^ + vv^)) = 
= roots of det (■u-u^('U'U^ + vv^)^^ — r^J) = 0. 

Then one shows that, knowing uu^ and vv^ are Wishart and conditi- 
ionally independent, the conditional distribution of rf > . . . > r^, given the 
matrix y is given by 



2)xCxilPyy^ 

{q/2),C,{I,) |A|! 



^ {n/2),C,{lPyy^P^n-^) ^ 

2^ (n/o^.rJTA ixii ^^^^ ^' 



where^ 



d2 j- / 2 2\ -'^p('^/2) 

R = diag(ri, ... ,r ), c„,p,q - 



T,{q/2)T,{{n-q)/2)Tp{p/2y 



By taking the expectation with regard to y or, what is the same, by integrat- 
ing over the matrix yy~^ , which is Wishart distributed, we obtain: 



random variables with zero means and covariance matrix S 



Theorem 5.1 Let Xi, . . . , Xp,Yi, . . . ,Yq (p < q) be normally distributed 

^21 S22 

Ifpl, ... , Pp are the roots o/det(Ej~/Ei2E22^S^2~P^-^) = 0; ^^s'^ maximum 
likelihood estimates rf, . . . , from a sample of size n (n > p + q) are given 
by the roots of 

det{xy^ {yy^)^^yx^ — r'^xx'^) = 0. 

Setting 

Z := diag(zi, . . . , Zp) = diag(ri, ... , r^) and := diag(pi, ... , pj). 



^Cn,p,q is a different constant from (0.0.20). 
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the Zi — rl have the following density 

1 

CorollEiry 5.2 If pi — ... — — 0, then the joint density of the Zi — r| is 

given by the density appearing in the integral of Theorem 1.1, namely 

7r^^/^c„,,,,A,(z) f[ zf'^'\l - z,)(--^-^-'y'dz,. (5.3.6) 

i=l 

RdTiiark: As was shown here, the normal distribution over R leads to the 
density (5.3.6) for the Zi, which corresponds to the case (3 — 1/2 for (1.0.3). 
Starting with normal distributions over C and H leads, in a similar way, to 
integrals (1.0.3) for the cases 13=1 and 2. 

6 Differential equations for the Grassmannian 
integrals and the hypergeometric functions 

6.1 Differential equations for the Grassmannian inte- 
gral 

Theorem 6.1 shows that the integral over the Grassmannian Gr(p, F") sat- 
isfies Painleve-like differential equations; for /9 = 1, this equation is the 
Painleve V equation with a specific boundary condition (Theorem 6.2). 

Theorem 6.1 The following holds for the integral 

Ip{y) = [ e^TV(/+ztz)-i det(ZtZ)-^(«-f)d//(Z) = c'exp /"^ H{y)dy, 

(6.1.1) 
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where H{y) — ^log/p(y) satisfies the differential equation (H' :— ^ and 
remember — 1 for (3 — 1 and — otherwise): 

4 [y^H'" + Qy^H'^ + (1 + 6^){2y^H" + Ay^HH' + yH^)^ - yPoH' + PiH + P2 



r 0, 



3 P{p- 1) 3 V2V2 



16(p + l)(p + 2)^ /2 ' 
3 P ^3 Ip-ilp+i 



for P — 1, (Painleve V) 



for P = 1/2, 



162p+l' 



72 



where 



for 13 = 2, 



Pi 



(6.1.2) 



/9=1 
/3=l/2 
/3 = 2 

m terms of 



4|/2 — 8s|/ + — 8 4:{sy — ri^) 4r{y + n) 

4y^ - 4sy + (n + 2)2 - 8 2s7/ - n(n - 2) r(27/ + n - 2) 
y2 -4sy + 4((n- 1)2 -2) 2sy - 4n(n + 1) r(2y + 4(n + l)) 



r = pg, s = n — 2p — 2q. 



(6.1.3) 



Theorem 6.2 For /3 = 1, we have 

(c« J-^ / e^TV(.+ztz)- det(ZtZ)-(^--)d/.(Z) 

/ 



(q-j)l ^ IT 

1 l>p{n~p) 



\ 



^{A2.}(A) n ^ 



V 



(«,i)SA 
n—q<j<n- 



r>0 fvGY. 



1 (p)K(g)« 



(/i'')2 (n)« 
= 2F^''{p,q;n;y) 



exp 



y 



dy 



(6.1.4) 
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where u{x) is the unique solution to the initial value problem: 
x^u'" + xu" + Qxu'"^ - Auu' + AQu' - 2Q'u + 2R = 



(Painleve V) 

with u(x) = pin — p) X + . . . + +an+ix'^^^ + Oix^^"^) + . . . , near x = Q. 

n 

(6.1.5) 

with a„+i specified (see remark). Q and R are polynomials in x: 

4Q = -x'^ j^2{n + 2{p-q))x-{n-2pf 

2R = p{p- q){x + n-2p). (6.1.6) 

The third order equation (6.1.5) has a first integral, which is second order 
in u and quadratic in u" , 

y!'-^ + ^ ([xv!'' + Qv! + R)v! - {y!'' + Q'v! + Rl)u + \q"u^ - t^^^_lt \ ^ o 

(6.1.7) 

Remark: Note that the Painleve equation (6.1.5) admits a solution 

nix) = pin — p) — — ^x + ^S^QiX^ (6.1.8) 

i>2 

with the Qi given by the indicial equation 

i{i - 1 - n){i - 1 + n)ai = gi{ao, ■ ■ ■ , a^-i), i = 1,2,. . . , 

showing the existence of a free parameter at i = n + 1. However the fact 
that, according to (4.2.3), 



uix)=p{n-p)-px + x-log}_^x 2^7^11 (n-i + l)^. 



p 

r>0 .... (h^y - ^ + 

leads to an explicitly known value for a„+i. 
Proof of Theorem 6.1: Define 

i,{t):= [ \A,{y)\''^fle^^=^'^yHl-y,ni + y,)%,, (6.1.9) 
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and the locus 

(l:^{tx^x^ 0, all other ti = 0}. (6.1.10) 

Using (1.0.3), and setting a — P{n — q — p + 1) — l,b = (3{q — p + 1) — 1 in 
(6.1.9), the hnear change of variables yk :— 2zk — 1 leads to: 



2/3 



[-1,1]P 



n e^-^Hl - y.)^("-^-^+'^-^(l + 2/fc)^(^-^+')-Myfe 



'Gr(p,F") 



I g2.TV(7+ztz)-i det(ZtZ)-'^(''-^')d/.(Z) 

(6.1.11) 



with 



(6.1.12) 



According to the appendix, the integral Ip{t) satisfies the Virasoro con- 
straints (8.0.10), with a — p{n — g— p+1) — 1,6 = (3{q — p + 1) — 1, and 
thus 

bo ^ a - b ^ P{n - 2q), bi ^ a + b ^ P{n - 2p + 2) - 2, 

and 

(Ti = l3n, (T2 = l3{n - 1) + 1. 
These expressions and their first ti- and t2- derivatives, evaluated along 
the locus C read as follows: (-F(t) := Fp{t) := log Ip{t)) 



= 



ii— + ]Fp+p{bo-h) 



dti 



fdE, 



= 



dh L 



-p 
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= 



92 ^2 92 5 Q 

\ dF„ d^F„ 



92 92 



The five equations above form a (triangular) linear system in five unknowns 











d'F, 




d'F, 




d'F, 


dt2 


c 




c 


dtidt2 


c 


dtidts 


c 


dtl 



which can be expressed in terms of 



dFp d'^Fp d^Fp 



dt^ ' dtj ' dtl ' 
Setting ti — x and = dFn/ dx, these expressions are 



dt2 

dFn 



dts 



92F„ 



dtidt2 
d^F„. 



-l-^(a,F'n-p{ti-h)) 

1 (-2l3U{Fli + F'^) + 2(i2 - hoti + aiC72)F^ 
'■1 ^ 

+p{{ai - 2(72 - bi)ti + 2&oa2)) 
^ (-aitiF^' + (71 + 6op) 



^ (-2/3t2(F;" + 2F;f;' - F';^) + 2ii(t2 - 6oii + <7i(72 + 

+ 2(60^1 - 2(71 (72)F; + P((2(72 - (71 + 6l)tl - 4(72&o) 



2^3 



dtl 



i (((72 + 2/3)tiF;' + 2(ihF'^ + {2boti - 2(71(72 - al)F'^ 

tl V 



+p((-o-i + 2(72 + - 60(0-1 + 2(72))) • 



(6.1.13) 



From (8.0.2), it follows that 
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php{t) p even, /? = 1/2 
Ip{t) = ^ p\Tp{t) p arbitrary, /3 = 1 (6.1.14) 
p\T2p{t/2) p arbitrary, (3 — 2, 

where in all three cases Tp{t) is a r-function satisfying the KP and Pfaff-KP 
equations (8.0.4). Substituting (6.1.14) in the equation (8.0.4) and evaluating 
along the locus C leads to the following equations: 



-'\p+l)ip + 2) /2 (1 
for /?= 1/2,1, 

for (3 = 2. (6.1.15) 
Then, using F{t) — logIp{t), with Ip as in (6.1.9) and (6.1.11), 

H{y) := -^log/ e^T'-(^+^'^)"det(ZtZ)-/^(^-^')(i/.(Z) 

dy JGr{p,¥^) 

^ -fiog(ef2'/2 
dy 

[ \Ap{z)\"'f]ey''/\l - ;^,)'^("-«-^'+^)-i(l + Zkf^'i-P+'^-'dzk 

J[-i,i]p V 



1 /'S^',.,!/ 



,ygt^h,,0,0,...)+Pj ^^^^^^ 



satisfies the differential equation, upon substituting the derivative (6.1.13) 

dH 

dy ■ 



into (6.1.15) {E' := f ): 
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4 (^y^H'" + 6y^H''^ + (1 + 6^){2y^H" + Ay'^HH' + yH^)^ - yPoH' + P^H + 



r 0, 



3, 



3 Ip-2lp+2 



{p+l){p+2y 12 



3^3 Ip-llp+l 

-y 



for = 



for /? = 1/2, 



for /3 = 2, 



where Pq, -Pi, -P2 are polynomials in y, with coefficients depending on r = pq' 
and s = n — 2p — 2q, given by table (6.1.3). 
From 

it follows that for /3 = 1/2 and 2, 



P 
p 



thus establishing (6.1.2). 
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= 2 



-4 (2^)-^p+l (3^) 



Proof of Theorem 6.2: In particular, for /3 = 1, from Theorem 4.2, Theorem 
6.1 and (6.1.5), 



d 

u{x) := X— log£;^,p 



aa5m}(a) n ^ 



v 



(i,j)eA 
rt—q<.j<n—p 



d_ 

dx 



log f e-f^x^'("-^') / e^^(^+^^^)"'det(ZtZ)-(«-^')d//(Z)) 

= —px+p{n — p)+xH{x) 

satisfies the differential equation (6.1.5). From Ip{x) = 1 + —x + . . . , as in 
(4.2.6), it has the behavior near x = 0, spelled out in (6.1.5), namely 



u{x) = p{n — p) — — —X + . . . . 



n 



(6.1.17) 
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Equation (6.1.7) follows from Cosgrove and Scoufis (fT^) (see 0) , and 
the constant is obtained by setting a; = in the equation and using the 
Taylor expansion (6.1.16) of u{x) about a; = 0. ■ 



6.2 Hypergeometric functions, KP hierarchy and 
Painleve equations 

Theorem 6.3 The hypergeometric function for /5 = 1 and p,q,n G C, ex- 
pressed in ti-variahles, 



r{t) := 2Fi^\p,q;'n;u 



(6.2.1) 



satisfies the Hirota bilinear relation as a function of ti,t2, namely for all 
t,t' G C°°, 



T{t - [z-^])T{t' + [z-^])dz = 



In particular, T{t) satisfies the KP hierarch'i]^ (k = 0,1,2, ...): 

1 



d I d 1 d 



Proof: Using 
one has the following formula 



2 dtidtk+3 
1 — a = e ^1 i 



r o r = 0. 



(6.2.2) 



(6.2.3) 



i<j<p 

l<fc<m 



OC ""fc j 



n 

i<j<p 

l<fc<m 

i<j<P 
i<i<p 



(6.2.4) 



10 



Given a polynomial p(ti,<2, •••), define the customary Hirota symbol p{dt)f o g := 

The siS are the elementary Schur polynomials 



^'(alr'af;' ■•■)/(* + y)9{i - y) 

gEr := Yji>n Si{t)z^ and for later use, set Si{d) := Si{^, . 
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On the one hand, setting a — (3{n — q) and h = (3qi\i (2.3.14) using the 
new variables tj, using the constant (0.0.20) and (6.2.4), the hypergeometric 
function equals an integral for integer p > 1, 

cgL 2F[''^\(5p,(5q,(5n-u,,...,u^) 



= / n (l-^.«.)-^|A,(^)r^n^f^-^+^)-^(l-^,)^("-^-^+^)-M^, 

-'[0,l]f i<fe<p 

l<j<m 

= / |Ap(;^)|2me'^S>=i**^^4(^-^+')-'(l-z,)^(--*'-«+i)-Mz,. (6.2.5) 

i[0,l]f 

According to (8.0.4), this integral is a solution of the Hirota bihnear relation 

for /9 = 1, g, n G M, integer p > 1, such that q — p + 1 and n — p — g+l>0. 

On the other hand, the hypergeometric function, is also defined by (0.0.16), 
for p, g, n e M and (3=1: 

2Fi\p,q,n;ui,...,uJ _^ ^ = 5Z Wpr^A(t) =: J] CASA(t). 

(6.2.6) 

For integer p > 1, the integral (6.2.5) was shown to be a solution of the 
Hirota bilinear relations and so the coefficients 



integer p>i 



satisfy Pliicker relations. They are homogeneous quadratic relations in a 
finite number of the c^'s for A G Y and they characterize the KP r-functions; 
so, we have for every integer p > 1, 

= ^ ax^i^cxCfj,, ax,,j, = ±1 



Ha(?^)m 



where Xi{n,q) are polynomials in n,q and is a finite sum. Setting p = 
distinct integers pk in sufficient number, one solves the homogeneous linear 
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system (6.2.7) in the Xi{n,q) with coefficients p^, whose determinant is a 
Vandermonde, therefore non-zero. This implies that Xi{n,q) — for all 
n, g e M and so, we also have 

Xi{n, q)p' = 0, for all p e M, 

i 

implying the Pliicker relations (6.2.6) for p,q,n G M. One can extend the 
argument further by analytic continuation to p, g, n e C. ■ 



Theorem 6.4 The hypergeometric function 

H{x) = ^log 2Fi^\p,q;n;y) 



(6.2.8) 



satisfies the Painleve V equation: 

4 (x^H'" + Qx^H'^ + 2{2x^H" + Ax^HH' + xH^)^ - xP^H' + P^H + Pa = 

(6.2.9) 

with Po,Pi,P2 as in (6.1.3). 

Proof: For integer p, g, with < p, g < n/2, we have by (2.3.12), 



2F[^\p,q]n]u^ 



X 



ul=Sux 



r>l 



r>0 AeY, 



with br rational in p, q, n, and so 



d 



H{x) := ^log 2F[^\p,q,n-y) 



{h^y (n)A 



E y\=5ux r>0 

also with Cr rational in p, q, n. Putting this expression into the left hand side 
of the Painleve V equation leads to 

oo 

drx"^ ., dr = rational in p, q, n, with universal coefficients. 

r=0 

For integer p, g, n with n/2 > p, g > 0, i?(x) satisfies the Painleve equation 
by the proof of Theorem 6.1 and so = 0. Using the same argument as in 
Theorem 6.3, we have dr — for all p,q,n e C, whenever the series 
makes sense. ■ 
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7 Differential equations for the Wishart and 
canonical correlations distributions 

Remember from section 1, the tangent space to the Grassmannians at Id = 



O 



is given by 



TidGr{p, F") = {Z arbitrary {n ~ p) x p matrix with values in F}. 
The subgroup 

acts on Tjd Gr(m, F") as 

AdBiZ) = fi22^fin , 

for which the spectrum (Ai, . . . , Am) of is a invariant under the action 
of K and Aj > 0, since the matrix Z'^Z is positive definite. 
Given E C [0, 00), consider the following probability: 

PS(E) := P(allA.Gi5;) 

= c,n f e-^'^^^'^^d^{Z) 

P'^ J ZeTj^Gr{p,¥") '^^ ^ 

spectrum (ztz)cE 
J,p\A,{z)\^^Ut^e-''^-hf^-'^^'^-'dz, 
J,,\A,izW^UUe-yi-hf^-''^'^-'dz;^ 



(7.0.1) 



where dfi{Z) is Haar measure (1.0.11) on Ti^Grlp, F") and where (3 = 1/2, 1, 2 
correspond to M, C, EI respectively. The function e"^^^-* will be either 

(i) Laguerre: e'^^'^ = e-'^z", F = R+, p= 1/2, 1, 2 

(ii) Jacobi: e'^^^) = (1 - z^z'', F=[0,1], p = 1/2, 1, 2 

(iii) Gaussian: e'^^^) = e"^", F = R+ , p = 1, n = 2p 

We shall only consider (i) and (ii), the Gaussian distribution being as in 

I2II. 



58 



Adler-vanMoerbekeiGrassmannian October 8, 2001 §7, p. 59 



7.1 Laguerre probability on the tangent space to Gr(p, F*^) 
and the Wishart distribution 

Theorem 7.1 For e'^^^/) ■- e'^y", the probability (7.0.1) defined on Z e 
the tangent space (at the identity) to the symmetric space Gr{p,¥"'), leads to 
the following probability on the (positive) spectrum (Ai, . . . , Xp) of Z^Z, 

P(2(max A,<a:) := c,,^ J e-'-^''^ {det Z^ Zyd^^iZ) 

spectrum (ztz)c[o,x] 
/f,^^,, \A,{y)\-^ e-'^^yr'^-'^^^'-'dy 



(7.1.1) 



f{x) = x-^logPjJ)(max Xi < x) 



'[0,oo) 

Then 

f(T\ — .7, 

dx 
satisfies 

• for 13 = 1: 

x^f" + xf" + 6x/'^ -^f f - W - h'f) = (Painleve V) 

with 



2h = {a + n — 2p — hxY — Anhx. 



for(5 = [ 



1/2 . 
2 • 



p(/3) p(/3) 

4 2 -1 , 4 ,2 



b^X^ 



(7.1.2) 

where 

a:=a + (3{n -2p) + (3 - I, 
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and 



Q 

Q2 
Qi 



-p{p-l){p + 2a){p + 2a + l), for P=l/2 



-p(2p+l)(2p + a)(2p + a-l), for (3 = 2 
a 



W - + 6ap + 4(1 - I3)a 



a 



= 2Pp' + 2ap + {l-P)a, Qo=p+—. 



(7.1.3) 



Corollary 7.2 Consider a matrix A, which is Wishart Wp{n, ^Ip)- distributed 
with eigenvalues Ai, . . . , Xp (see Muirhead, p. 107). Then 



d 



satisfy the equation 

q / pw pw 



-p{p - l)n{n - 1) 



n,pJ 



with 



x'f" - x'f + 6x'f" - x{8f + (bx - Qof - AQ,)f', 



Qq = l^in + p - I) and 4Qi = (n - l)(4p + 1) - p, 



Proof: According to (5.1.2), the Wishart distribution Wp{n, ^Ip) of the pxp 
matrix A is given by 

P„^(dA) = rp(n/2)-^6"^'/2e-''^^(detA)5("-f-i) dAij. (7.1.4) 

l<i<j<p 

and so the joint probability P^(maxj \ < x) is precisely formula (7.1.1) , 



with a =p/2 and /3 = 1/2, 

Pn,p{^^ \<X) = Cn,p,l 



= PS(max A, <a;) 



=p/2 
= 1/2 
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Therefore P^p{x) also satisfies tfie inductive differential equation (7.1.2); we 
only need to check that 

r ^^"^ 1 

J[0,x\P±^ {' 



p±2 

,P/2,i({"±4)-2(p±2))-| 



Q,Qo,Qi,Q2 can immediately be computed by setting (3 = 1/2, 5^ = 1 and 
2a = n-p- 1 in (7.1.3). ■ 



7.2 Jacobi probability on the tangent space to Gr{p, F"^) 
and the sample canonical correlation distribution 

Theorem 7.3 For e~^^^^ := (1 — y)'^y'^, the probability (7.0.1) defined on 
Z G the tangent space (at the identity) to the symmetric space Gr(p, F"-), 
leads to the following probability on the (positive) spectrum of Z^Z, 

Pf)(maxA,<^) := / e'^^'^^'^^dfiiZ) 

n,p\ ^ I — 2 / ZeTj^Gr(p,¥n) / 

spectrum (ztz)c[o,£^] 

/^,^^,, \A,{y) nr=i(i - y^ry■^'^''-''^'^-'dy, 

/[o,!]. |A,(z/)P^nLi(i - y^ry■^'^''-''^'^-'dm ' 

(7.2.1) 

satisfies the following differential equations, upon setting 
fix) (1 -x^)^iogp(5(maxA, < ^), 
• for P = I: 

2{x^ - Iff" + 4(x2 - 1) {xf" - 3/'') + (I6x/ - ii(x2 - 1) - 2sx - r) f 
-f(Af-ux-s)^0 (7.2.2) 
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= 4(w + l){x^ - 1)2 (-u(a;2 - 1)/"' + (12/ - ux - 3s) f" + 6u{u - 1)/'^) 

-(a;^ - l)/'(24/(u + 3)(2/ - s) + 8fu{5u - l)x - u{u + l){ux^ + 2sx + 8) + Q2) 
+/(48/^ + 48f{ux + 2x-s) + 2f (Su^a;^ + 2ux'^ - 12usx - 2Asx + Qi) 

— u{u + l)x{'iux^ + sx — 2ux — 3m) + Q^x — Qisj , (7.2.3) 

where 



bo ^ a - b - P{n - 2p + 1) + 1, 61 = a + 6 + - 2p + 1) - 1 
r = |(&o + {bi + 2- 2(5f) s = |6o(&i + 2-2(5) 
u^^{2Pp + bi + 2-2(3){2(3p + b,), 

Q = ^ ((s^ -ur + u^f - 4(rs^ - Aus^ - As^ + wV)) 

= - Swr - 6r + 2?/^ + 23ii + 24 
g2 = 3ms^ + -Au^r + 2ur + Au^ + IQu^ 

Q4 = 9s2_3^^_g^^^2^22M + 24 = Qi + (6s^-'u2-'u). 

(7.2.4) 



Corollary 7.4 Let A = ZZ^ have the Wishart distribution Wp+qln,!!)- 
distribution. Break up the matrices S and A, as follows: 

P Q 

< — > < — > 

/En Sl2^ Ip a=( ^ Xp 
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Assume the eigenvalues pf,--- , Pp o/E7i^Ei2X122'^E72 all zero. Then the prob- 
ability distribution of the eigenvalues rf, . . . ,rp of AiiAi2A2iAj2 (sample 
canonical correlation coefficients) is given by 

Pn,pA^) = no <^'<^ forl<i<p) 

^ /[o,i±.]. |A,(^)| nLi z!^-''"^^ - z,Y^i--^-^-'Mz, 

and 

fix) := (l-x^)^logP„^,» 
satisfy the inductive PDE: 

^ I Pn+A,p+2,q+2Pn-A,p-2,q-2 _ . ] _ f samc cxprcssion as the 
^ \ pc^^ J ~ \ right hand side of (7.2.3) 

with Qi,Q2,Q3 being symmetric polynomials ofp,q, given by (7.2.4), where 

u — n{n — 2) 

r/4 = nV2-n(p + g)+p^ + ?2 

s — {n — 2p) [n — 2q) 

Q = A8pq{p-l){q-l){n-p){n-q){n-p-l){n-q-l). 

Remark: For instance 

Qi = 24(n-l)(p^ + g^-n(p + g))+48pg(n^-n(p + g)+p?) 
-(n-6) (n-lf(n + A). 

Proof: The proof follows immediately from Corollary 5.2 and Theorem 7.3. ■ 

8 Appendix: The Pfaff-KP hierarchy and Vi- 
rasoro constraints 

Consider weights of the form p[z)dz := e^^^^Mz on an interval F = [A, B] C 
M, with rational logarithmic derivative and subjected to the following bound- 
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ary conditions: 

-P-^V'^^-^ \^^J{z)p{z)z'' = for all k > 0, (8.0.1) 

Theorem 8.1 The multiple integrals 

I,{t;(3) = / \A,{z)\'^f\(e^^'^^lp{zk)dz,), for p>0 

k=i ^ ^ 
p\Tp{t) p even, /3 = 1/2 
p\Tp{t) p arbitrary, /3 = 1 (8.0.2) 
p\T2p{t/2) p arbitrary, (5 — 2 

with Iq — 1, satisfy 

(i) the following Virasoro constraints for all k > —1: 

E ^4'i,p(^,P) - b^ ^4'i+i,,(i,p)) Ip{t; = 0, (8.0.3) 

in terms of the coefficients ai, h,i of the rational function (— logp)' 
(a) The Pfaff-KP hierarchy: (see footnote 14 for notation) 

f ~ 1 \ 

\Sk+A{d) - 2 Qt^Qtk+z ) Sl)Sk{d) Tp+2 O Tp_2 (8.0.4) 

p even, k = 0,1, 2, ... . 

of which the first equation reads (p even) 

= 12^t?^(i _ i^j. (8.0.5) 

(Hi) More generally, the functions T(t) satisfy the Hirota bilinear relations for 
allt,t' e C°° andm,p positive integers (see footnote 10 for notation) 

• (3^1 

/ Tp{t - [z-^])Tp{t' + [z-^])e^^^''-'>'dz = 0, (8.0.6) 

J z=oo 
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• /3 = 1/2 and 2 

J z=oo 

+ <f T2p+2(i + W)T2^(i'- Wje^S^^^^-^'^^-'^^^-^'^ci^^O, (8.0.7) 

Jz=0 

Example (Jacobi /^-integral) 

This case is particularly important, because it covers the integrals in The- 
orems 0.2 and 0.3. The weight and the Oj and 6j, as in (8.0.1), are given 

:= = (1 -.)"(! + V' = j^ a-bHa^b)z 

qq — l,ai — 0, a2 — —1, bo — a — b, bi — a + b, and all other a^, bi — 0. 
The integrals 

/,= / \A,{z)f^fl{l-z,r{l + Zk)'e^^=^'^'idzk (8.0.8) 
•^^^ k=i 

satisfy the Virasoro constraints {k > —1): 

= ( "Jfi,, - "JS + "Ji'i, + fi "Jl'i J = 0. (8.0.9) 

Introducing cTj = (2p — i — 1)(3 + i + 1 + hi, 

Then introducing the function Fp := log rp(t), the two first Virasoro con- 
straints for m = 1, 2 divided by Tp are given by 

(8.0.10) 
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